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Abstract 

With reference to the questionnaire adopted within the Italian project 
"Ulisse" to assess health condition of elderly people, we investigate two im- 
portant issues: discriminant power and actual number of dimensions meas- 
ured by the items composing the questionnaire. The adopted statistical 
approach is based on the joint use of the latent class model and a multidi- 
mensional item response theory model based on the 2PL parametrization. 
The latter allows us to account for the different discriminant power of these 
items. The analysis is based on the data collected on a sample of 1699 
elderly people hosted in 37 nursing homes in Italy. This analysis shows 
that the selected items indeed measure a different number of dimensions of 
the health status and that they considerably differ in terms of discriminant 
power (effectiveness in measuring the actual health status). Implications 
for the assessment of the performance of nursing homes from a policy-maker 
prospective are discussed. 

Keywords: Latent class model, item response theory models, performance 
evaluation. 
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1 Introduction 



The progressive ageing of the contemporary society, due to the increasing life 
expectation, has raised the demand for health assistance, stimulating the debate 
on the quality of the care provided by nursing homes. According to the OECD, the 
percentage of elderly people in the population of the industrialized countries will 
increase from 13% of 2000 to 25% of 2040. As a consequence, the cost of health 
care, especially for long term care assistance, will rapidly increase. Anderson and 
Hussey (2000), considering the eight most industrialized OECD countries, found 
that the cost for care assistance of elderly people increased from 3% to 5% of the 
GDP between 1993 and 1999 and the cost for long term health assistance increased 
from 0.9% and 1.6% in the same period. In Italy, public spending for long term 
assistance in 2002 was, in terms of GDP, about 0.7%, which is expected to strongly 
increase in the next decade. 

A direct consequence of the increasing cost for health assistance is that the 
Governments of industrialized countries have begun to consider the problem of 
the rationalization of public interventions, in terms of public spending, regulation, 
and policies. Public intervention is also crucial to guarantee the accessibility to 
care facilities of elderly people with low income, without affecting the economic 
status of their families. Since the access to long term care assistance is conditioned 
on the possibility to pay for the services provided by nursing homes, there could 
be an artificial low level of demand for long term care assistance; see Alecxih and 
Kennell (1994). 

The above arguments imply that methodological tools to analyze the perform- 
ance of facilities for care assistance of elderly people are of central interest for 
policy-makers. The aim is twofold: (i) to promote the rise of the quality of exist- 
ing institutions (especially nursing homes) that have to satisfy standard criteria 
for care assistance and (ii) to implement a strategy to reform the role of pub- 
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lie institutions in this context. At this regard, a great debate arises about the 
construction of indicators able to measure nursing home performance and then to 
effectively rank the facilities in a certain geographical region; see Phillips et al. 
(2007) and Harrington et al. (2003). In the United States, one of the most import- 
ant projects of nursing home facility ranking is "Medicare", which is supported 
by the Department of Health and Human Services. This project has developed 
an evaluation system which is based on indicators that are generally linked to 
the psycho-physics condition of elderly people which are related, in particular, to 
mobility diseases, behavioral disorders, and memory problems. Data are collected 
by public institutions through questionnaires administered at regularly repeated 
occasions. 

One of the main ideas behind the facility ranking is that unidimensional criteria 
to classify the health conditions of elderly people are available; see Phillips et al. 
(2007) and Mor et al. (2003). This assumption implies that the difference between 
two subjects in responding to a set of items on the health condition only depends 
on a single latent trait summarizing this condition. This may obviously be a 
restrictive assumption. For instance, this does not allow us to classify subjects 
who show a degenerative health status in relation to a specific pathology, but 
apparently have an overall good health status. Violations of this assumption may 
lead to misleading conclusions reached on the basis of a unidimensional ranking 
which relies on a single score assigned to each facility. 

On the other hand, Kane (1998) identified at list five different groups of res- 
idents of nursing homes. These groups include subjects recovering from an acute 
episode and are likely to return home, residents who are cognitive impaired, resid- 
ents who are cognitively intact but suffer from physical challenges, residents who 
are in vegetative state, and residents that are terminally ill. These subjects have 
different needs and different levels of the quality of life; see Kane (1981). Con- 
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sequently, the actions of the nursing homes could be more forceful in relation to 
specific areas of intervention, so that some facilities can be specialized in improving 
the state of health in relation to some pathologies. Along these lines, dimension- 
ality of health condition becomes a central issue for obtaining a consistent and 
generalizable ranking index for the performance of the nursing homes. 

In the present paper, we simultaneously address the issue of item selection and 
of dimensionality on the basis of a formal statistical procedure (Bartolucci, 2007), 
which exploits the latent class (LC) model and a class of item response theory (IRT) 
models; see Lazarsfeld and Henry (1968), Goodman (1974, 1978), and Hambleton 
(1996). Through these methodologies, we study the above issues on the basis of 
a dataset coming from the database "Ulisse", which is collected within a survey 
carried out in Italy on the basis of the RAI-MDS questionnaire (Morris, 1997). The 
questionnaire covers several aspects of the health status of elderly people hosted in 
nursing homes. In particular, we consider 89 among the around 300 available items. 
These items characterize: (i) cognitive conditions, (ii) auditory and view fields, 
(Hi) humor and behavioral disorders, (iv) activities of daily living, (v) incontinence, 
(vi) nutritional field, (vii) dental disorders, and (viii) skin conditions. By using 
a so large number of items we can fully characterize the health status of elderly 
people hosted in the nursing homes, without imposing any a-priori restriction on 
the relevance of its different components. From the original set of 89 items we then 
extract a subset of 35 items on the basis of their discriminant power, that is the 
effectiveness in measuring these conditions. The adopted methodology is based 
on the joint use of LC and IRT models and represents a useful tool to reduce the 
size of the present and similar questionnaires, with the obvious consequence of 
reducing the survey costs. 

On the basis of the applied methodology we show that the 35 selected items 
indeed measure five different dimensions which may be referred to as: (i) cognitive 
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conditions, (ii) auditory and view fields, (in) activities of daily living and incontin- 
ence, (iv) humor and behavioral disorders and skin conditions, and (v) nutritional 
field and dental disorders. These dimensions have a clear interpretation; this seems 
to confirm the robustness of the proposed analysis. 

The reminder of the paper is organized as follows. In Section 2 we describe 
the dataset on health conditions of elderly people hosted in certain of nursing 
homes across the Italian regions. In Section 3 we briefly review the statistical 
methodology based on LC and IRT models. In Section 4 we describe in detail the 
empirical analysis and in Section 5 we report the main conclusions of the study. 

2 The Ulisse database 

We consider a dataset collected within the "Ulisse" project, which is carried out 
by the Italian Ministry of Health jointly with the Italian Society of Gerontology 
and Geriatrics. The project is based on a longitudinal survey, covering 17 Italian 
regions, about the assistance level provided to patients hosted in 37 randomly 
chosen nursing homes. This survey is carried out since 2004 through the repeated 
administration of a questionnaire (every 6 months) which is filled up by the nursing 
assistant of each patient and concerns several aspects of the everyday life. For our 
analysis we consider only the first interview, which covers 1699 patients. 

Table 1 reports the geographical distribution, on the Italian territory, of the 
elderly people and the nursing homes included in the study. We observe that most 
of the sample is in the north regions: the percentage of patients in these regions is 
about 85%. In particular, 40% of the full sample is located in Lombardia (17%), 
Veneto (11%), and Emilia Romagna (12%). In Table 2 we report some descriptive 
statistics on gender and age of patients of the nursing homes. 

We observe that most of the sample is composed by women (71%), with only 
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Table 1: Regional distribution of the elderly people and nursing homes included in 
the study. 

Number of Numbers of 



Region subjects Percentage nursing homes 



North 


Piemonte 


105 


6.18 




2 




Lombardia 


292 


17.19 




6 




Trcntino A. A. 


74 


4.36 




1 




Vcneto 


194 


11.42 




5 




Friuli V. G. 


92 


5.41 




1 




I , i fri in o 
1.J1H, HI la 


133 


7.83 




2 




Emilia R. 


214 


12.60 




4 


Center 


Tuscany 


83 


4.89 




3 




Umbria 


142 


8.36 




3 




Marchc 


48 


2.83 




1 


South 


Abruzzo 


78 


4.59 




2 




Molise 


46 


2.71 




2 




Campania 


52 


3.06 




1 




Puglia 


51 


3.00 




1 




Calabria 


42 


2.47 




1 


Hands 


Sicilia 


31 


1.82 




1 




Sardcgna 


22 


1.29 




1 




Total 


1,699 


100.00 




37 




Table 2: 


Nursing homes population by age and gender. 






Age 






Male 


Female 


Total 


> 70 






4.72 


3.24 


7.96 


70 - 75 






4.78 


5.37 


10.14 


75 - 80 






6.07 


11.85 


17.92 


80 - 85 






5.72 


15.74 


21.46 


85 - 90 






3.36 


15.68 


19.04 


> 90 






4.19 


19.28 


23.47 


Total 






28.83 


71.17 


100.00 



29% of men. Moreover, the age distribution differs in relation to gender, with a 
higher proportion of females with age 85 and over and a relative younger male 
population. The presence of a so high percentage of old women can obviously 
condition the analysis about the care facility performance. 

From the original questionnaire we single out 89 among the items concerning: 
(i) cognitive conditions, (it) auditory and view fields, (Hi) humor and behavioral 
disorders, (iv) activities of daily living, (v) incontinence, (vi) nutritional field, (vii) 
dental disorders, and (viii) skin conditions. The complete list of items is reported 
in Appendix 1. 
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3 The statistical methodology 

In the following, we briefly review the LC model (Goodman, 1974) and the IRT 
model proposed by Bartolucci (2007) for the study of multidimensionality. The 
aim of the methodology based on these models is: (i) to include in the analysis only 
the items necessary to identify the number of latent traits; (it) to identify the latent 
structure representing the health status of elderly people; (Hi) to investigate the 
different discriminant power of the items. This methodology allows us to choose a 
convenient partition of the selected items according to the dimension they measure 
and to analyze its correspondence in terms of facility care. 

3.1 The latent class model 

The LC model (Lazarsfeld and Henry, 1968; Goodman, 1974, 1978) assumes that 
the observed sample is drawn from a population which is partitioned into k latent 
classes, with ir c being the prior probability (or weight) of class c (c = 1, . . . , k). 
For each subject i (i — 1, . . . , n), we observe a vector y i = (yn, ....,yij) of binary 
response variables corresponding to J items. Given that the subject is in class c 
and with reference to item j, the conditional probability of success is denoted by 
Aj| c . 

Under the assumption of local independence, the probability of the response 
pattern y { is 

p(Vi) = IX^kK, 

c 

Mc) = HA^d A,-,.)' ««. 

3 

The log-likelihood function of the LC model, which is used for parameter estima- 
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tion, is then 

m = ^o gP { yi ) (i) 

i 

where 6 is a short-hand notation for all model parameters. This function is max- 
imized by the EM algorithm (Dempster et al., 1977; Goodman, 1978). This is an 
iterative algorithm which is based on two steps to be repeated until convergence: 

• E-step: compute the conditional expected value of the log-likelihood given 
the observed data and the current value of the parameters; 

• M-step: update the model parameters by maximizing the expected log- 
likelihood obtained at the E-step. 

We initialize the algorithm by both deterministic and random starting values in 
order to prevent the problem of multimodality of the likelihood. This is a typical 
problem of latent variable models. 

Obviously, when the LC model is applied to analyze a dataset, choosing the 
number of latent classes is necessary. At this aim, we rely on the Bayesian Inform- 
ation Criterion (BIC) of Schwarz (1978), which is based on the index: 

BIC k = -24 + m fc log(n), 

where, for a given number of classes k, is the maximum of the log-likelihood 
given in (1) and is the corresponding number of parameters. The latter is 
taken as a measure of complexity of the model on which the above penalization 
term is based. According to this criterion, the number of classes corresponding to 
the minimum of BICk has to be selected. This number is indicated by k. 

Through the EM algorithm we obtain, for each latent class c, the maximum 
likelihood estimate of the weight, denoted by n c , and of the conditional probabil- 
ities of success, denoted by Aj| c , j — 1, . . . , J. On the basis of the latter ones we 

8 



can obtain a measure of the discriminant power of the items measuring each di- 
mension. In our study, each dimension corresponds to a group of items measuring 
a different aspect of the health status of elderly people. In particular, we measure 
the discriminant power of item j by the following ratio: 



denominator of (2) stands for the maximum of the difference at the numerator 
with respect to all the items measuring the same dimension of item j. In this way, 
the above index is always between and 1 and we exclude from the analysis those 
items which have an index value lower than a given threshold because they show 
a reduced discriminant power. This approach for reducing the number of items is 
compared with the one based on the IRT model that we present in the following 
section. 

3.2 The multidimensional two-parameter logistic model 

This model is a constrained version of the LC model which directly includes, for 
each item, a parameter measuring its discriminant power. The model is based on 
the following multidimensional two-parameter logistic (2PL) parametrization of 
the conditional probabilities of success: 



1 max h(j -)(M h )' 
Mj = max(Aj| c ) — min(Aj| c ), 



(2) 





j = 1, . . . , J. 



(3) 
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In the above expression, 8^ is a dummy variable equal to 1 if item j measures 
dimension d (d — 1, . . . , s) and to otherwise. Moreover, 9 c d is a measure of the 
latent trait (dimension d) for the subjects in latent class c (typically referred to as 
ability), (3j is a measure of the overall tendency to respond to item j (typically 
referred to as difficulty), and 7j measures the discriminant power of this item 
(typically referred to as discriminant index). 

The log-likelihood function of the model may again be expressed as: 

£(e) = J2\ogJ2p(y t \c)7r c ; 

i c 

maximization of 1(0) is again performed by the EM algorithm of Dempster et al. 
(1977), which has a the same structure outlined in the previous section. 

From the EM algorithm we also obtain, for each item j, the estimate 7j of the 
discriminant index. In order to select a suitable set of items on the basis of these 
estimates, we rely on the ratio 



j_y ■ — , i £± i 

3 max Mj) (7 fe ) 

This index has a structure similar to that in (2), with the difference Mj between the 
maximum and the minimum of the estimated response probabilities substituted 
by the estimated discriminant index. Then, items with a value of D* lower than a 
certain threshold are dropped. 

Another analysis that is allowed by the 2PL model presented above is that 
of dimensionality. In particular, through this model we can test the hypothesis, 
indicated in the following by H , that the items actually measure a reduced number 
of dimensions. For instance, we can test the hypothesis that the items measure 
s — 1 instead of s dimensions. The s — 1 dimensions are specified by collapsing two 
dimensions into one and then grouping the corresponding items. 
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In order to test H , we use the likelihood ratio statistic 



Li? = 2 ]>>(</) log 
v 

where the sum is extended to all the possible response configurations y, n(y) stands 
for the observed frequency of configuration y, p(y) is estimated probability of this 
configuration under the model with s dimensions, and po(y) is the corresponding 
estimate under the reduced model with s — 1 dimensions. Under H , this statistic 
has a x 2 asymptotic distribution with a number of degrees of freedom equal to 
the k — 2, where k is the adopted number of latent classes. Then, the hypothesis 
is rejected if the observed value of Li? is larger than a suitable percentile of this 
distribution. 

On the basis of the above testing procedure, we can implement a hierarchical 
algorithm for clustering items into a reduced number of groups. Items in the 
same group are supposed to measure the same dimension and then each group 
corresponds to a different dimension. In particular, the clustering algorithm begins 
by fitting the model in which items are grouped according the structure of the 
questionnaire. Then, all the possible ways to collapse two groups are considered 
and the one giving the smallest value of Li? with respect to the previous step 
is selected. This procedure is repeated until the unidimensional IRT model (in 
which all items are included in the same group) is fitted. Finally, the selected 
number of groups (and then of dimensions) is the smallest number for which the 
value of LR, computed with respect to the previous classification, is smaller than 
a suitable percentile of the asymptotic distribution. For a detailed illustration of 
the algorithm see Bartolucci (2007). We perform this classification once the items 
with a reduced discriminating power are eliminated as indicated above. 



p(y) 

Po(y) 
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4 Application to the Ulisse dataset 



Following the empirical strategy outlined in the previous section, we analyze the 
Ulisse dataset described in Section 2. We first present the results from the LC 
model and then those from the IRT model. 

4.1 Latent class analysis 

As starting point we choose the number of latent classes for the LC model applied 
to the 89 selected items. At this aim, Table 3 reports the maximum log-likelihood 
and the corresponding value of the BIC index for a number of latent classes from 
1 to 7. 

Table 3: Selection of the number of latent classes for the LC model. For each 
number of classes from 1 to 7, 4 is the maximum log-likelihood of the model, nik 
is the number of parameters, and BICk is the corresponding value of the BIC 
index. In boldface are reported the quantities corresponding to the selected model. 



k 


4 


m k 


BIC k 


1 


-37175.939 


89 


75013.842 


2 


-32773.208 


179 


66877.781 


3 


-31444.523 


269 


64889.813 


4 


-30432.381 


359 


63534.930 


5 


-29875.393 


449 


63090.356 


6 


-29423.379 


539 


62855.730 


7 


-29159.367 


629 


62997.107 



On the basis of these results we select k = 6 latent classes. These classes 
correspond to different degrees of impairment of the elderly people health status. 
In order to interpret these classes, Figure 1 reports the graph of the conditional 
probabilities of success estimated under the selected LC model for each class. To 
have a clearer interpretation of these results, we order the six classes according to 
the probability of success for the first item. 
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Figure 1: Plot of the estimated conditional probabilities of success Xj\ c under the 
selected LC model. Each panel corresponds to a different latent class c, ordered 
according to the probability of success for the first item. 




The estimated conditional probabilities obtained from the LC model are then 
used to assess the discriminant power of the 89 selected items. At this aim, in 
Table 4 we report for each item j the value of the index Dj, computed according 
to (2), together with the weighted mean and standard deviation of the estimated 
success probabilities, computed with weights corresponding to the estimated class 
probabilities. 
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Table 4: Weighted mean and standard deviation of the estimated success probab- 
ilities \j\ c for each item j , together with the indices Mj and Dj used to measure 
the discriminant power. In boldface are the quantities referred to the item that for 
each dimension d has the highest discriminant power. 



j 


item 


d 


mean 


std 






j 


item 


d 


mean 


std 


Ma 


Dj 


1 


CC1 


1 


0.629 


0.107 


0.753 


0.760 


40 


ADL1 


4 


0.459 


0.243 


0.876 


1.000 


2 


CC2 


1 


0.577 


0.126 


0.807 


0.814 


41 


ADL2 


4 


0.481 


0.264 


0.869 


0.992 


;-j 


CC3 


1 


0.538 


0.177 


0.875 


0.883 


42 


ADL3 


4 


0.202 


0.289 


0.461 


0.526 


4 


CCA 


1 


0.404 


0.277 


0.991 


1.000 


43 


ADL4 


4 


0.198 


0.290 


0.440 


0.502 


5 


CC5 


1 


0.442 


0.226 


0.877 


0.885 


44 


ADL5 


4 


0.382 


0.241 


0.721 


0.823 


6 


CC6 


1 


0.398 


0.264 


0.921 


0.929 


45 


ADL6 


4 


0.355 


0.214 


0.618 


0.705 


7 


CC7 


1 


0.609 


0.142 


0.865 


0.873 


46 


ADL7 


4 


0.563 


0.215 


0.870 


0.993 


8 


CC8 


1 


0.542 


0.119 


0.827 


0.835 


47 


ADL8 


4 


0.246 


0.291 


0.753 


0.860 


9 


CC9 


1 


0.426 


0.159 


0.761 


0.768 


48 


ADL9 


4 


0.449 


0.224 


0.754 


0.861 


10 


CC10 


1 


0.410 


0.154 


0.711 


0.717 


49 


ADL10 


4 


0.618 


0.207 


0.857 


0.978 


11 


ecu 


1 


0.401 


0.126 


0.668 


0.674 


50 


ADL11 


4 


0.775 


0.030 


0.238 


0.272 


12 


CC12 


1 


0.316 


0.173 


0.533 


0.538 


51 


ADL12 


4 


0.442 


0.214 


0.830 


0.947 


13 


CC13 


1 


0.439 


0.086 


0.593 


0.598 


52 


ADL13 


4 


0.187 


0.279 


0.395 


0.451 


14 


CAS1 


2 


0.254 


0.205 


0.29b 


0.348 


53 


ADL14 


4 


0.299 


0.189 


0.524 


0.598 


15 


CAS2 


2 


0.277 


0.296 


0.847 


1.000 


54 


ADL15 


4 


0.224 


0.262 


0.480 


0.548 


16 


CAS3 


2 


0.341 


0.202 


0.704 


0.831 


55 


ADL16 


4 


0.538 


0.092 


0.652 


0.744 


17 


CAS4 


2 


0.302 


0.282 


0.843 


0.995 


56 


ADL17 


4 


0.401 


0.159 


0.611 


0.697 


18 


CAS5 


2 


0.331 


0.152 


0.465 


0.549 


57 


ADL18 


4 


0.183 


0.285 


0.378 


0.432 


iy 


HBU1 


3 


(J.220 


0.212 


0.598 


0.882 


58 


1 1 


5 


U.462 


U.245 


u.yiv 


l.UUU 


20 


HBD2 


3 


0.262 


0.195 


0.532 


0.785 


59 


12 


5 


0.512 


0.194 


0.821 


0.895 


21 


HBD3 


3 


0.251 


0.194 


0.482 


0.711 


60 


13 


5 


0.288 


0.135 


0.294 


0.321 


22 


HBDA 


3 


0.335 


0.150 


0.678 


1.000 


61 


14 


5 


0.392 


0.093 


0.353 


0.385 


23 


HBD5 


3 


0.182 


0.235 


0.506 


0.746 


62 


15 


5 


0.026 


0.440 


0.020 


0.022 


24 


HBD6 


3 


0.243 


0.187 


0.441 


0.650 


63 


16 


5 


0.022 


0.450 


0.061 


0.067 


25 


HBD7 


3 


0.126 


0.287 


0.308 


0.454 


64 


17 


5 


0.174 


0.261 


0.251 


0.274 


26 


HBD8 


3 


0.283 


0.161 


0.588 


0.867 


65 


18 


5 


0.019 


0.445 


0.038 


0.041 


27 


HBD9 


3 


0.295 


0.150 


0.555 


0.819 


66 


19 


5 


0.002 


0.474 


0.011 


0.012 


28 


HBD10 


3 


0.249 


0.185 


0.524 


0.773 


67 


110 


5 


0.068 


0.401 


0.200 


0.218 


29 


HBD11 


3 


0.292 


0.143 


0.409 


0.603 


68 


111 


5 


0.006 


0.465 


0.016 


0.017 


30 


HBD12 


3 


0.424 


0.088 


0.568 


0.838 


69 


112 


5 


0.026 


0.444 


0.054 


0.059 


31 


HBD13 


3 


0.209 


0.203 


0.393 


0.580 


70 


113 


5 


0.665 


0.068 


0.499 


0.544 


32 


HBD14 


3 


0.297 


0.150 


0.423 


0.624 


71 


114 


5 


0.197 


0.245 


0.281 


0.306 


33 


HBD15 


3 


0.443 


0.070 


0.502 


0.740 


72 


115 


5 


0.007 


0.464 


0.016 


0.017 


34 


HBD16 


3 


0.503 


0.057 


0.553 


0.816 


Y3 


Nl 


6 


0.29b 


U.14U 


U.3U4 


U.H10 


35 


HBD17 


3 


0.150 


0.273 


0.396 


0.584 


74 


JV2 


6 


0.121 


0.352 


0.334 


1.000 


36 


HBD18 


3 


0.264 


0.170 


0.481 


0.709 


75 


N3 


6 


0.014 


0.459 


0.030 


0.090 


37 


HBD19 


3 


0.132 


0.293 


0.245 


0.361 


76 


N4 


6 


0.087 


0.347 


0.105 


0.314 


38 


HBD20 


3 


0.204 


0.223 


0.322 


0.475 


77 


N5 


6 


0.026 


0.445 


0.051 


0.153 


39 


HBD21 


3 


0.217 


0.197 


0.396 


0.584 


78 


N6 


6 


0.088 


0.335 


0.182 


0.545 
















79 


N7 


6 


0.043 


0.412 


0.072 


0.216 
















80 


N8 


6 


0.203 


0.220 


0.329 


0.985 
















81 


Dl 


V 


0.094 


0.370 


U.225 


0.674 
















82 


D2 


7 


0.377 


0.081 


0.334 


1.000 
















83 


D3 


7 


0.409 


0.072 


0.238 


0.713 
















84 


D4 


7 


0.068 


0.383 


0.047 


0.141 
















85 


D5 


7 


0.013 


0.449 


0.049 


0.147 
















86 


D6 


7 


0.398 


0.073 


0.179 


0.536 
















87 


SKI 


8 


U.U88 


U.37U 


U.220 


l.UUU 
















88 


SK2 


8 


0.029 


0.441 


0.062 


0.282 
















89 


SK3 


8 


0.126 


0.317 


0.203 


0.923 



As described in Section 3.1, the index Dj may be used to select a subset of 
items which provide a similar amount of information than the full set of items. 
This is obtained by comparing the values of this index with a suitable threshold 
between and 1. In particular, for different threshold levels we report in Table 5 
the number of selected items for each dimension. For instance, with a threshold 
of 0.5 we retain in the analysis 63 items. 



14 



Table 5: Results from the item selection procedure based on the indices Dj in terms 
of number of items retained for each dimension. 



dimension 



threshold 


1 


2 


3 


4 


5 


6 


7 


8 


overall 


0.0 


13 


5 


21 


17 


16 


8 


6 


3 


89 


0.1 


13 


5 


21 


18 


8 


7 


6 


3 


81 


0.2 


13 


5 


21 


18 


8 


6 


4 


3 


78 


0.3 


13 


5 


21 


17 


6 


5 


4 


2 


73 


0.4 


13 


4 


20 


17 


3 


4 


4 


2 


67 


0.5 


13 


4 


18 


15 


3 


4 


4 


2 


63 


0.6 


11 


3 


15 


11 


2 


3 


3 


2 


50 


0.7 


10 


3 


12 


10 


2 


3 


2 


2 


44 


0.8 


7 


3 


6 


8 


2 


3 


1 


2 


32 


0.9 


2 


2 


1 


5 


1 


3 


1 


2 


17 


1.0 


1 


1 


1 


1 


1 


1 


1 


1 


1 



4.2 Item response theory analysis 

To complete the item selection analysis we exploit the alternative approach based 
on the 2PL model illustrated in Section 3.2. The same model is then used to 
evaluate the dimensionality of health condition of elderly people. The results of 
the analysis are finally compared with those presented in the previous section and 
based on the LC model. Since the adopted IRT model can be seen as a constrained 
version of the LC model, there are no compatibility problems in comparing the 
results of the two analyses. 

On the basis of the same number of classes selected above, k = 6, we obtain 
the estimates of the parameters in (3). In particular, for each item j we report in 
Table 6 the estimated difficulty level f3j and discriminant index jj, together with 
the index D* defined in (4). On the basis of the values of this index, we can select 
a suitable number of items. The results of this selection process are reported in 
Table 7 for different threshold levels between to 1 . 
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Table 6: Parameter estimates under the 2PL model. For each item j , $j is the 
estimated difficulty, jj is the estimated discriminant power, and D* is the relative 
discriminant power. In boldface are the quantities referred to the item that for each 
dimension d has the highest discriminant power. 





3 


item 


Ij 


Pi 


D* 
i 




3 


item 


„ 

1j 


5 

Pi 


D* 
i 


1 


1 


CC1 


1.000 


0.000 


0.706 


4 


40 


ADL1 


1.000 


0.000 


0.819 


1 


2 


CC2 


1.161 


0.492 


0.820 


4 


41 


ADL2 


1.107 


-0.142 


0.907 


1 


3 


CC3 


1.416 


0.875 


1.000 


4 


42 


ADL3 


0.681 


2.843 


0.558 


1 


4 


CC4 


1.178 


1.742 


0.832 


4 


43 


ADL4 


0.585 


3.123 


0.479 


1 


5 


CC5 


0.989 


1.347 


0.699 


4 


44 


ADL5 


1.018 


0.872 


0.834 


1 


6 


CC6 


1.098 


1.666 


0.775 


4 


45 


ADL6 


0.817 


1.082 


0.669 


1 


7 


CC7 


1.280 


0.283 


0.904 


4 


46 


ADL7 


0.975 


-1.303 


0.799 


1 


8 


CC8 


0.938 


1.134 


0.663 


4 


47 


ADL8 


1.221 


1.758 


1.000 


1 


9 


CC9 


0.928 


2.093 


0.655 


4 


48 


ADL9 


0.732 


-0.051 


0.600 


1 


10 


CC10 


0.805 


2. 154 


0.568 


4 


49 


ADL10 


1.155 


-2.084 


0.946 


1 


11 


ecu 


0.750 


2.501 


0.530 


4 


50 


ADL11 


0.212 


-6.810 


0.174 


1 


12 


CC12 


0.657 


3.113 


0.464 


4 


51 


ADL12 


0.641 


-0.132 


0.525 


1 


13 


CC13 


0.463 


2.355 


0.327 


4 


52 


ADL13 


0.386 


3.596 


0.316 


2 


14 


CAS1 


1.000 


0.000 


0.177 


4 


53 


ADL14 


0.337 


2.049 


0.276 


2 


15 


CAS2 


4.954 


-0.806 


0.879 


4 


54 


ADL15 


0.429 


2.626 


0.352 


2 


16 


CAS3 


3.673 


-0.883 


0.652 


4 


55 


ADL16 


0.415 


-1.089 


0.340 


2 


17 


CAS4 


5.636 


— 0.829 


1.000 


4 


56 


ADL17 


0.461 


0.418 


0.378 


2 


18 


CAS5 


1.702 


-0.602 


0.302 


4 


57 


ADL18 


0.392 


3.674 


0.321 


3 


19 


HBD1 


1.000 


0.00(1 


0.0 YU 


b 


58 


11 


1.000 


0.000 


U.U5U 


3 


20 


HBD2 


5.773 


-1.646 


0.403 


5 


59 


12 


1.013 


-0.349 


0.051 


3 


21 


HBD3 


3.630 


-1.453 


0.253 


5 


60 


13 


0.086 


10.000 


0.004 


3 


22 


HBD4 


2.722 


-1.459 


0.190 


5 


61 


14 


0.327 


1.181 


0.016 


3 


23 


HBD5 


0.508 


2.305 


0.035 


5 


62 


15 


0.355 


10.000 


0.018 


3 


24 


HBD6 


2.959 


-1.352 


0.206 


5 


63 


16 


0.654 


6.530 


0.033 


3 


25 


HBD7 


3.225 


-1.110 


0.225 




64 


17 


0.160 


10.000 


0.008 


3 


26 


HBD8 


1.111 


-0.626 


0.078 


5 


65 


18 


0.434 


10.000 


0.022 


3 


27 


HBD9 


1.867 


-1.208 


0.130 


5 


66 


19 


20.000 


1.914 


1.000 


3 


28 


HBDK) 


3.443 


-1.425 


0.240 


5 


67 


110 


0.586 


4.627 


0.029 


3 


29 


HBD11 


3.885 


-1.586 


0.271 


5 


68 


111 


0.541 


10.000 


0.027 


3 


30 


HBD12 


3.285 


-1.693 


0.229 


5 


69 


112 


0.421 


8.777 


0.021 


3 


31 


HBD13 


2.297 


-1.144 


0.160 


5 


70 


113 


0.605 


-1.818 


0.030 


3 


32 


HBD14 


12.159 


-1.787 


0.848 


5 


71 


114 


0.436 


3.367 


0.022 


3 


33 


HBD15 


5.465 


-1.813 


0.381 


5 


72 


115 


0.545 


10.000 


0.027 


3 


34 


HBD16 


6.356 


-1.869 


0.443 


6 


73 


Nl 


1.000 


0.000 


0.359 


3 


35 


HBD1T 


14.337 


-1.729 


1.000 


6 


74 


JV2 


2.784 


-0.142 


1.000 


3 


36 


HBD18 


3.580 


-1.494 


0.250 


6 


75 


N3 


0.975 


3.359 


0.350 


3 


37 


HBD19 


8.130 


-1.606 


0.567 


6 


76 


N4 


1.267 


1.132 


0.455 


3 


38 


HBD20 


9.393 


-1.713 


0.655 


6 


77 


N5 


0.310 


10.000 


0.111 


3 


39 


HBD21 


4.940 


-1.568 


0.345 


6 


78 


N6 


0.233 


10.000 


0.084 














6 


79 


N7 


0.289 


10.000 


0.104 














6 


80 


N8 


1.350 


0.365 


0.485 














7 


81 


L)l 


l.OUO 


o.uoo 


I.UOO 














7 


82 


D2 


0.050 


8.553 


0.050 














7 


83 


D3 


0.583 


-2.037 


0.583 














7 


84 


D4 


0.201 


10.000 


0.201 














7 


85 


D5 


0.710 


4.550 


0.710 














7 


86 


D6 


0.050 


6.764 


0.050 














8 


87 


SKI 


l.OUO 


O.UOO 


I.UOO 














8 


88 


SK2 


0.328 


8.156 


0.328 














8 


89 


SK3 


0.519 


1.185 


0.519 



Compared to the LC model (see Table 5), the IRT model chooses a reduced 
number of items to keep into the analysis, appearing less conservative in terms of 
the item selection process. In fact, by using a critical value of 0.5 we select 35 items, 
instead of 63 chosen by the LC based procedure. The choice of that critical value 
has been addressed to keep in the analysis a relevant number of items, without 
loosing too much information in relation to the analyzed phenomenon. 
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Table 7: Results from the item selection mechanism based on the indices D* in 
terms of number of items retained for each dimension. 



dimension 



threshold 


1 


2 


3 


4 


5 


6 


7 


8 


overall 


0.0 


13 


5 


21 


18 


15 


8 


6 


3 


89 


0.1 


13 


5 


18 


18 


1 


7 


4 


3 


69 


0.2 


13 


4 


15 


17 


1 


5 


4 


3 


62 


0.3 


13 


4 


8 


16 


1 


5 


3 


3 


53 


0.4 


12 


3 


6 


11 


1 


3 


3 


2 


41 


0.5 


11 


3 


4 


10 


1 


1 


3 


2 


35 


0.6 


9 


3 


3 


8 


1 


1 


2 


1 


28 


0.7 


6 


2 


2 


6 


1 


1 


2 


1 


21 


0.8 


4 


2 


2 


5 


1 


1 


1 


1 


17 


0.9 


2 


1 


1 


3 


1 


1 


1 


1 


11 


1.0 


1 


1 


1 


1 


1 


1 


1 


1 


8 



Once we have selected the number of items, we perform the hierarchical cluster 
analysis on the dimensionality, starting from the eight dimensions defined by the 
structure of the questionnaire. The approach that we use at this aim is described 
at the end of Section 3.2 and the results obtained from its application are reported 
in Table 8 and represented by the dendrogram in Figure 2. In particular, the table 
shows the list of the dimensions formed by the collection of the items corresponding 
to the eight initial dimensions, together with the statistic LR computed with 
respect to the model chosen at the previous step and the corresponding p-value. 

Table 8: Output of the hierarchical cluster algorithm based on the 2PL multidi- 
mensional model. 



h 


s 


clusters 


LR 


p- value 


1 


7 


{1}, {2}, {3}, {6}, {7}, {8}, {4, 5} 


0.379 


0.984 


2 


6 


{1}, {2}, {6}, {7}, {4, 5}, {3, 8} 


3.871 


0.424 


3 


5 


{1}, {2}, {4, 5}, {3, 8}, {6, 7} 


5.235 


0.264 


4 


4 


{1}, {2}, {3, 8}, {4, 5, 6, 7} 


29.045 


0.000 


5 


3 


{3, 8}, {4, 5, 7, 8}, {1,2} 


60.142 


0.000 


6 


2 


{1,2}, {3, 4, 5, 6, 7, 8} 


42.107 


0.000 


7 


1 


{1,2,3,4,5,6,7,8} 


187.440 


0.000 
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These results show evidence of five dimensions which have the following struc- 
ture: {1}, {2}, {4, 5}, {3, 8}, {6, 7}. These dimensions correspond to: (i) cognitive 
conditions, (ii) auditory and view fields, (Hi) activities of daily living and incontin- 
ence, (iv) humor and behavioral disorders and skin conditions, and (v) nutritional 
field and dental disorders. These dimensions have a clear interpretation and seems 
to confirm the robustness of the proposed analysis. 

Figure 2: Dendrogram based on the 2PL multidimensional model. The eight initial 
dimensions characterize: (i) cognitive conditions, (ii) auditory and view fields, 
(Hi) humor and behavioral disorders, (iv) activities of daily living, (v) continence, 
(vi) nutritional field, (vii) dental disorders, and (viii) skin conditions. 



250 - 



200 - 



'i 150 - 
as 

at 
a 

100 - 



50 - 



I I I I I ! ' I I I I I 

4 5 6 7 3 8 1 2 

Items 

Finally, in Table 9 we report the estimated abilities 6 c d for each latent class, 
together with estimated class weights 7r c . With reference to every dimension d, 
each parameter 9 c d corresponds to the latent trait level for the subjects in class c. 
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In the present study, high values of the parameter correspond to high probability 
to suffer from a certain pathology. 

Table 9: Estimated ability parameters 8 c d for each latent class c and dimension d, 
together with the estimated weights n c . 

latent dimension 

class I 2 3 4 5 weight 



1 


-2.516 


-2.690 


-5.179 


-5.137 


-4.815 


0.213 


2 


1.142 


-1.171 


-3.402 


-2.224 


-3.927 


0.153 


3 


-1.253 


-2.711 


-2.227 


-3.203 


1.050 


0.131 


4 


3.996 


0.702 


-1.960 


0.525 


-2.025 


0.102 


5 


2.068 


-1.667 


-1.946 


-1.973 


1.333 


0.160 


6 


4.386 


-0.451 


-0.727 


2.117 


1.787 


0.238 



For each pair of dimensions (di,d 2 ), it may be interesting to compute the 
correlation Pd lt d 2 between the estimated ability levels. Taking into account the 
class weights, we compute these correlation indices as 

X)c(#cdi — 9 dl ){6 c d 2 — d2 )iic 

Pd 1( i2 



where 6 a = J2 c ^cd^c is the average ability level for dimension d. The results are 
reported in Table 10. 



Table 10: Correlation between the estimated abilities for each pair of dimensions 

(di,d 2 ). 

1st dimension 
2nd dimension I 2 3 4 5 



1 


1.000 








2 


0.912 


1.000 






3 


0.866 


0.648 


1.000 




4 


0.964 


0.863 


0.898 


1.000 


5 


0.611 


0.287 


0.905 


0.661 1.000 
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Note that, apart from three pairs of dimensions for which these correlations 
are particularly high (1st and 2nd, 1st and 4th, and 3rd and 5th), the other 
correlations are smaller than 0.9. In particular, the correlation between the 2nd 
and the 5th dimensions is smaller than 0.3; moreover, the correlation is smaller 
than 0.7 for three other pairs of dimensions (1st and 5th, 2nd and 3rd, and 4th 
and 5th). This confirms that the dimensions found by the clustering algorithm 
are actually distinct and then measuring the health condition of elderly people 
necessarily requires a multidimensional scale. 

5 Conclusions 

In the present paper we simultaneously study the issue of item selection and of 
dimensionality of health status of elderly people hosted in nursing homes. Our 
statistical approach is based on the joint use of latent class (LC) and item response 
theory (IRT) models. 

The study is based on a dataset collected in Italy within the "Ulisse" project, 
which relies on a sample of 1699 elderly people hosted in 37 nursing homes. The 
health status of these patients is assessed by a set of items which are administered 
at repeated occasions. From the original dataset, we extract 89 items, which 
characterize different areas of the health status at the first visit. In particular, we 
consider eight groups of items (each corresponding to a different dimension), which 
measure: (i) cognitive conditions, (it) auditory and view fields, (Hi) humor and 
behavioral disorders, (iv) activities of daily living, (v) incontinence, (vi) nutritional 
field, (vii) dental disorders, and (viii) skin conditions. 

The analysis initially exploits the LC model for selecting a subset of items 
which provides an amount of information close to that of the full set of items. 
In particular, through the Bayesian Information Criterion we find evidence of the 
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presence of six latent classes. Then, through the estimated conditional probabil- 
ities of the LC model we start ranking the items according to their discriminant 
power. This is measured by the standardized difference between the estimated con- 
ditional probabilities across latent classes. However, this selection process appears 
too conservative. 

The items selection analysis is then completed by using an IRT model based on 
a multidimensional 2PL parametrization. In particular, the applied strategy first 
selects a benchmark model which has a number of dimensions equal to the eight 
initial dimensions defined by the questionnaire and then, by applying the 2PL 
model, selects the items depending on their discriminatory power in measuring 
the latent trait. The subset of selected items is then used to study the dimension- 
ality of the health condition of elderly people. A this aim, we apply a hierarchical 
clustering algorithm which, starting from the multidimensional model with eight 
dimensions, ends with the unidimensional model. Within these two extremes we 
find all the possible numbers of dimensions of the analyzed phenomenon and select 
the most suitable by a series of likelihood ratio tests. On the basis of this procedure 
we find evidence of five dimensions obtained by collapsing the initial eight dimen- 
sions. These groups have the following structure: {1}, {2}, {4, 5}, {3, 8}, {6, 7}. 
These five dimensions may be referred to as: (i) cognitive conditions, (ii) aud- 
itory and view fields, (Hi) activities of daily living and incontinence, (iv) humor 
and behavioral disorders and skin conditions, and (v) nutritional field and dental 
disorders. 

Finally, the applied methodology suggests that the dimensionality of health 
status of elderly people is a relevant aspect to be considered in order to obtain a 
clear classification of the nursing home facilities in the Italian context. Moreover, 
the IRT analysis shows that the identified dimensions have not the same discrim- 
inating power in determining the health status. 
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Appendix 1: Description of the selected Items 



n item 


Our 

class 


Ulissc 
class 


item description 


01 


CC1 


c2a 


recalls what recently happened (5 minutes) 


02 


CC2 


c2b 


keeps some past memories green 


03 


CC3 


e3a 


recalls the actual season 


(14 


CC4 


e3b 


recalls where is him room 


05 


CC5 


e3c 


recalls the names and faces of the staff 


06 


CC6 


e3d 


recalls where he is 


07 


CC7 


c4 


decides about his daily activities 


08 


CC8 


e5a 


gets easily sidetracked 


09 


CC9 


c5b 


shows episodes of altered perception or awareness of surrounding 


10 


CC10 


c5c 


shows episodes of disorganized speech 


11 


ecu 


c5d 


has periods of restlessness movements 


12 


CC12 


e5e 


shows lethargic spans 


13 


CC13 


e5f 


does its cognitive conditions change during the day 


14 


CAS1 


fl 


shows hearing deficiency 


15 


CAS2 


f4 


making self-understood 


16 


CAS3 


f5 


has a clear language 


17 


CAS4 


f6 


is capable of understand others 


18 


CASS 


gl 


is able to sec in conditions of adequate lighting 


19 


HBD1 


hla 


made negative statements 


20 


HBD2 


hlb 


made repetitive questions 


21 


HBD3 


hlc 


made repetitive verbalizations 


22 


HBD4 


hid 


shows persistent anger with self or others 


23 


HBD5 


hie 


shows self deprecation discstccm 


24 


HBD6 


hlf 


X j_ i 

expresses iearr- arc not real 


25 


HBD7 


hlg 


believes itself to be dying 


26 


HBD8 


hlh 


complains about his health 


27 


HBD9 


hli 


unpleasant mood in morning 


^® 


ttSRi ? 


Vi "I i 


has problems with sleep 


29 


HBU11 


hlk 


suffers from insomnia 


30 


HBD12 


hll 


has expressions of sad-faced 


31 


HBD13 


him 


easily tears 


32 


HBD14 


hln 


shows repetitive movements 


33 


HBD15 


hlo 


abstains from activities of interest 


34 


HBD16 


hip 


shows reduced local interactions 


35 


HBD17 


h4al 


wanders aimlessly 


36 


WRrJi a 




uses offensive language 


37 




\*A 1 
htr\ 1 


is physically aggressive 




HdUJU 




has a socially inappropriate behavior 






n4el 


refuses assistance 


40 


ADL1 


jlal 


needs support in moving to/from lying position 


41 


ADL2 


jlbl 


needs support in moving to/from bed, chair, wheelchair 


42 


ADL3 


jlcl 


walks between different points within the room 








walks in the corridor 


44 


ADL5 


"i 1 n~\ 
J 1L1 




45 


ADL6 


il fl 


walks outs id^thc^iir^ing^om^ward 


46 


ADL7 


jlgl 


needs support for dressing 


47 


ADL8 


jlhl 


needs support for eating 


48 


ADL9 


jlil 


needs support using the toilet room 


49 


ADLK) 


jljl 


needs support for personal hygiene 


50 


ADL11 


j2al 


needs support for taking full-body bath/shower 


51 


ADL12 


j3al 


shows balance problems 


52 


ADL13 


j4al 


shows loss of mobility in the neck 


53 


ADL14 


j4bl 


shows loss of mobility in the arm including shoulder or elbow 


54 


ADL15 


j4cl 


shows limitations in the movements of the hand including wrist or finger 


55 


ADL16 


j4dl 


shows loss of mobility of the leg and hip 


56 


ADL17 


j4el 


shows loss of mobility of the foot and ankle 


57 


ADL18 


j4fl 


shows limitations in other movements 


58 


11 


kla 


fecal incontinence 


59 


12 


klb 


urinary incontinence 


60 


13 


k2a 


elimination of faeces (constipation) 


61 


14 


k2b 


stipsi 


62 


15 


k2c 


diarrhea 


63 


16 


k2d 


fccaloma 


64 


17 


k3b 


need aids (bladder retraining) 


65 


18 


k3c 


need aids (external catheter) 


66 


19 


k3d 


need aids (indwelling catheter) 


67 


110 


k3e 


need aids (intermittent catheter) 


68 


111 


k3f 


did not use toilet room/commodc/urina 


69 


112 


k3g 


pads/briefs used 


70 


113 


k3h 


encmos/irrigations used 


71 


114 


k3i 


ostomy present 


72 


115 


k3j 


need aids (others) 


73 


Nl 


nla 


chewing problem 


74 


N2 


nib 


swallowing problem 


75 


N3 


nlc 


mouth pain 


76 


N4 


n3a 


lose weight 


77 


N5 


n3b 


gain weight 


78 


N6 


n4a 


complain about the taste of many foods 


79 


N7 


n4b 


complains of being hungry 


80 


N8 


n4c 


leave the food on his plate 


81 


Dl 


ola 


debris (soft, easily movable substances) present in mouth prior to going to bed at night 


82 


D2 


olb 


has dentures or removable bridge 


83 


D3 


ole 


some/all natural teeth lostdocs not have or does not use dentures (or partial plates) 


84 


D4 


old 


broken, loose, or carious teeth 

Inflamed gums (gingiva); swollen or bleeding gums; oral abscesses; ulcers or rashes 


85 


D5 


ole 


86 


D6 


olf 


daily cleaning of teeth/dentures or daily mouth carcby resident or staff 


87 


SKI 


p2a 


pressure ulcer 


88 


SK2 


p2b 


stasis ulcers 


89 


SK3 


p3 


had an ulcer that was resolved or cured 



24 



